BIOTECHNOLOGY ^0 _g 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: .■■XQfhlo^^fofp 

Source: ^QjQ J 

Date Processed by STIC:. ^ ^ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 

TRADEMARK OFFICE WEBSITE SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom 

Any reply including a sequence listing in electronic form should NOT be sent to die 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1 EFS-Bio (<http://www.usnto,gov/cbc/cfs/downloads/docu merits. htm> , EFS Submission 
User Manual - cPAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 06/05/04): 

U.S. PateiU and Tradeniark Omce, 220 20 lh Street S:, Customer Window; Mail Stop Sequence, Crystal Plaza TworLobby, 
Room 1B03, Arlington, VA 22202 



Revised 05/17/04 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER:. 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADER, WINCH WERE INSERTED BY PTO SOFTWARE 



I 



_Wrapped Nuclcics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word pr6cessor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 

Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

Misaligned Amino The numbering under each 5* amino acid is misaligned. Do not use lab codes between numbers: 
Numbering use space characters, instead. 

Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules Please 

ensure your subsequent submission Is saved in ASCII text. 

Variable Length Sequcnce(s) contain n's or Xaa's representing more than one ysiduc. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing 

Patcntln 2.0 A "bug" in Patcntln version 2.0 has caused the <220>-<223> section to be missing from ammo acid 

"bug" sequences(s) . Normally, Patcntln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies (o (he mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Scqucncc(s) _____ missing. If intentional, please insert the following lines for each skipped sequence 
(2) INFORMATION FOR SCQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not inscn any subheadings under this heading) 

(xi) SEQUENCE DESCRIPT10N:SEQ ID NO X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped ■•' 7" 

Please also adjust the "(n) NUMBER OF SEQUENCES :" response to include the skipped sequences 



Skipped Sequences 
(NEW RULES) 



Sequcnce(s) 



<2I0> sequence id number 
<400> sequence* id number 
000 



missing. If intentional, please insert the following lines for each skipped sequence 



Use of n's or Xaa's 
(NEW RULES) 



10_ Invalid <2I3> 

Response 



1 ^^Usc o 



f <220> 



12 Patentln 2.0 

"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents 

Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown, Artificial Sequence, or 
scientific name (GcnusApcctcs) <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 

MS 



Scqucncc(s) 1 missing the <220> "Feature** and associated numeric identifiers and rcspunses 

Use of <220> to <223> is MANDATORY if <213> "Organism" responsris "Artificial Sequence" or 
"Unknown." [Please explain source of yenctic material in <?20> to <223 > section. / 
(Sec "Federal Register," 00701/1998, Vol. 63, No. 104, pp. 29<1JJ C J!1) {M. 1 



T3cTn-frm>f Sequence Rules) 



Please do not use "Copy to Disk" function of Patcntln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa V can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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RAW SEQUENCE LISTING DATE: 07/02/2004 

PATENT APPLICATION: US/10/667 , 966 TIME: 11:51:28 

Input -Set : A:\Untitled.ST25.txt 

Output Set: N:\CRF4\07022004\J667966.raw 

3 <110> APPLICANT: Xie, Dong 

4 Jiang, He 

6 <120> TITLE OF INVENTION: Peptide Derivative Fusion Inhibitors of HIV Infection 

8 <130> FILE REFERENCE: 63024.000002 
C--> 10 <140> CURRENT APPLICATION NUMBER: US/10/667 , 966 
C--> 10 <141> CURRENT FILING DATE: 2003-09-23 

10 <150> PRIOR APPLICATION NUMBER: 60/412,797 

11 <151> PRIOR FILING DATE: 2002-09-24 
13 <160> NUMBER OF SEQ ID NOS : 15 

15 <170> SOFTWARE: Patentln version 3.1 . p A nmiw 

17 <2io> seq id no: i Does Not oompiy . 

18 <211> LENGTH : 44 CO^ted DiSte«0 HQQ^ 

19 <212> TYPE: PRT ' 

20 <213> ORGANISM: Artificial Sequence 

22 <220> FEATURE: ^ r Jf jj o0 6 rroT 

23 <223> OTHER INFORMATION: peptide sequence J ^gC ' 7^"^ ' J^^smA 
25 <400> SEQUENCE: 1 tt^VsT**** f ^TfOrc^J 

2 7 Ser Leu Glu Gin lie Trp Asn Asn Met Thr Trp Glu Glu Trp Asp Arg 
28 1 5 10 15 

31 Glu lie Asn Asn Tyr Thr Glu Leu lie His Glu Leu lie Glu Glu Ser 

32 20 , 25 30 

3 5 Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 
36 35 40 

39 <210> SEQ ID NO: 2 

40 <211> LENGTH: 34 

41 <212> TYPE: PRT 

42 <213> ORGANISM: Artificial Sequence 

44 <220> FEATURE: 

45 <223> OTHER INFORMATION: ^P^ptide sequence^) 
47 <400> SEQUENCE: 2 ! 

49 Trp Glu Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Lys Leu lie His 

50 1 5 10 15 

53 Glu Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 

54 20 25 30 
57 Leu Leu 

61 <210> SEQ ID- NO: 3 

62 <211> LENGTH: 3 9 

63 <212> TYPE: PRT 

64 <213> ORGANISM: Artificial Sequence 
66 <220> FEATURE: 



67 <223> OTHER INFORMATION: ( Peptide sequence^) 
69 <4 00> SEQUENCE: 3 "! 

71 Trp Gin Glu Trp Glu Gin Lys lie Thr Ala Leu Leu Glu Gin Ala Gin 



file://C:\CRF4\OUTHOLD\VsrJ667966.htm 
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RAW SEQUENCE LISTING DATE: 07/02/2004 

PATENT APPLICATION: US/10/667 , 966 TIME: 11:51:28 

Input Set : A:\Untitled.ST25.txt 

Output Set: N:\CRF4\07022004\J667966.raw 

72 1 5 10 15 

75 lie Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Trp 

76 20 25 ■ 30 
7 9 Ala Ser Leu Trp Glu Trp Phe 

80 35 

83 <210> SEQ ID NO: 4 

84 <211> LENGTH: 36 

85 <212> TYPE: PRT 

86 <213> ORGANISM: Artificial Sequence 

88 <220> FEATURE: 

89 <223> OTHER INFORMATION: ^Peptide 
91 <400> SEQUENCE: 4 

93 Tyr Thr Ser Leu lie His Ser Leu lie Glu Glu Ser Gin Asn Gin Gin 

94 1 5 10 15 

97 Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu 

98 20 25 30 

101 Trp Asn Trp Phe 

102 35 

105 <210> SEQ ID NO: 5 

106 <211> LENGTH: 34 

107 <212> TYPE: PRT 

108 <213> ORGANISM: Artificial Sequence 

110 <220> FEATURE: ^ 

111 <22 3> OTHER INFORMATION : ( ^Peptide sequencg> ^O^VvE^ ^J^TQ/T 
113 <400> SEQUENCE: 5 

115 Trp Met Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Leu lie His 

116 15 10 15 

119 Ser Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 

120 20 25 30 
123 Leu Leu 

127 <210> SEQ ID NO: 6 

128 <211> LENGTH: 34 

12 9 <212> TYPE: PRT 

130 <213> ORGANISM: Artificial Sequence 

132 <220> FEATURE: ^ 

133 <223> OTHER INFORMATION ^Peptid e sequence^ 
135 <400> SEQUENCE: 6 

13 7 Trp Gin Glu Trp Glu Arg Lys Val Asp Phe Leu Glu Glu Asn lie Thr 
138 1 5 10 15 

141 Ala Leu Leu Glu Glu Ala Gin lie Gin Gin Glu Lys Asn Met Tyr Glu 

142 20 . 25 30 
145 Leu Gin 

149 <210> SEQ ID NO: 7 

150 <211> LENGTH: 34 

151 <212> TYPE: PRT 

152 <213> ORGANISM: Artificial Sequence 
154 <22 0> FEATURE: 



155 <223> OTHER INFORMATION : /Peptide sequence^) 
157 <400> SEQUENCE: 7 ^ — 
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RAW SEQUENCE LISTING DATE: 07/02/2004 

PATENT APPLICATION: US/10/667,966 TIME: 11:51:28 

Input Set : A:\Untitled.ST25.txt 

Output Set: N:\CRF4\07022004\J667966.raw 

159 Trp Glu Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Lys Leu lie His 

160 1 5 10 15 

163 Glu Leu He Glu Glu Ser Gin Asn Gin Gin Glu Glu Asn Glu Gin Glu 

164 20 25 30 
167 Leu Leu 

171 <210> SEQ ID NO: 8 

172 <211> LENGTH: 44 

173 <212> TYPE: PRT 

174 <213> ORGANISM: Artificial Sequence 

176 <220> FEATURE: ^ 

177 <223> OTHER INFORMATION: /Peptide sequence^) 

179 <220> FEATURE: ^ 

180 <221> NAME/KEY: MISC_FEATURE 

181 <222> LOCATION: (23) . . (23) 

182 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie 

183 ty. 

186 <400> SEQUENCE : 8 

188 Ser Leu Glu Gin He Trp Asn Asn Met Thr Trp Glu Glu Trp Asp Arg 

189 15 10 15 
W--> 192 Glu lie Asn Asn Tyr Thr Xaa Leu lie His Glu Leu He Glu Glu Ser 

193 20 25 30 

196 Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu 

197 35 40 

200 <210> SEQ ID NO: 9 

201 <211> LENGTH: 45 
2 02 <212> TYPE: PRT 

203 <213> ORGANISM: Artificial Sequence 

205 <220> FEATURE: ^ ^ _ 

206 <223> OTHER INFORMATION :MPep tide sequence) "So^TnS^ €LS" 
2 08 <22 0> FEATURE: : 

209 <221> NAME /KEY : M I SC_FEATURE 

210 <222> LOCATION: (45) . . (45) 

211 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie 

212 ty. 

215 <400> SEQUENCE: 9 

217 Ser Leu. Glu Gin lie Trp Asn Asn Met Thr Trp Glu Glu Trp Asp Arg 

218 15 10 15 

221 Glu lie Asn Asn Tyr Thr Glu Leu He His Glu Leu He Glu Glu Ser 

222 20 25 30 
W--> 225 Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Xaa 

226 35 40 45 

229 <210> SEQ ID NO: 10 

230 <211> LENGTH: 34 

231 <212> TYPE: PRT 

232 <213> ORGANISM: Artificial Sequence 

234 <220> FEATURE: ^ ^ 

235 <223> OTHER INFORMATION: (Peptide sequence ^ 

237 <220> FEATURE: 

238 <221> NAME/KEY: MI SC_FEATURE 
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IE: ^—S— 

INFORMATION :(Peptide sequenc^ 7 



RAW SEQUENCE LISTING DATE: 07/02/2004 

PATENT APPLICATION: US/10/667 , 966 TIME: 11:51:28 

Input Set : A:\Untitled.ST25.txt 

Output Set: N:\CRF4\07022004\J667966.raw 

239 <222> LOCATION: (13) . . (13) 

240 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie 

241 ty. 

244 <400> SEQUENCE: 10 

W--> 246 Trp Glu Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Xaa Leu lie His 

247 1 5 10 15 

250 Glu Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Trp Glu 

251 20 25 30 
254 Leu Leu 

258 <210> SEQ ID NO: 11 

259 <211> LENGTH: 35 

260 <212> TYPE: PRT 

261 <213> ORGANISM: Artificial Sequence 

263 <220> FEATURE: 

264 <223> OTHER 

266 <220> FEATURE: 

267 <221> NAME / KEY : MI SC_FEATURE 

268 <222> LOCATION: (35) . . (35) 

269 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie 

270 ty. 

273 <400> SEQUENCE: 11 

275 Trp Glu Glu Trp Asp Arg, Glu lie Asn Asn Tyr Thr Glu Leu lie His 

276 1 5 10 15 

279 Glu Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin Glu 

280 20 25 30 
W--> 283 Leu Leu Xaa 

284 35 

287 <210> SEQ ID NO: 12 

288 <211> LENGTH: 39 

289 <212> TYPE: PRT 

290 <213> ORGANISM:. Artificial Sequence 

292 <220> FEATURE: — . ^ f 

293 <223> OTHER INFORMATION: ( £epETde sequenc^ ^^^^^C CXC£>t 

295 <220> FEATURE: 

296 <221> NAME /KEY: MISC_FEATURE 

297 <2 22 > LOCATION: (13).. (13) 

298 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie 

299 ty. 

3 02 <400> SEQUENCE: 12 

W--> 304 Trp Gin Glu Trp Glu Gin Lys lie Thr Ala Leu Leu Xaa Gin Ala Gin 

305 1 5 10 15 

308 He Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Trp 

309 20 25 30 

312 Ala Ser Leu Trp Glu Trp Phe 

313 35 

316 <210> SEQ ID NO: 13 

317 <211> LENGTH: 40 

318 <212> TYPE: PRT 

319 <213> ORGANISM: Artificial Sequence 
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RAW SEQUENCE LISTING ( DATI£ : 07/02/2004 

PATENT APPLICATION: US/10/667 , 966 TIME: 11:51:28 

Input Set : A:\Untitled.ST25.txt 

Output Set: N:\CRF4\07022004\J667966.raw * 

321 <220> FEATURE: 

322 <223> OTHER INFORMATION : peptide sequence^ 

324 <220> FEATURE: 

325 <221> NAME /KEY : MISC_FEATURE 

326 <222> LOCATION: (40) . . (40) 

327 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie 

328 ty. 

331 <400> SEQUENCE: 13 

33 3 Trp Gin Glu Trp Glu Gin Lys lie Thr Ala Leu He Glu Gin Ala Gin 
334 15 10 15 

33 7 He Gin Gin Glu Lys Asn Glu Tyr Glu Leu Gin Lys Leu Asp Lys Trp 
338 20 25 30 

W--> 341 Ala Ser Leu Trp Glu Trp Phe Xaa 
342 35 40 

345 <210> SEQ ID NO: 14 

346 <211> LENGTH: 34 

347 <212> TYPE: PRT 

348 <213> ORGANISM: Artificial Sequence 

350 <220> FEATURE: — 

351 <223> OTHER INFORMATION : peptide sequence) "SOS^^ £^ r0f 

353 <220> FEATURE: ^ 

354 <221> NAME/KEY: MI SC_FEATURE 

355 <222> LOCATION: (13).. (13) 

356 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie 

357 ty. 

360 <400> SEQUENCE: 14 
W--> 362 Trp Glu Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Xaa Leu lie His 

363 15 10 15 

366 Glu Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Glu Asn Glu Gin Glu 

367 20 25 30 
370 Leu Leu 

374 <210> SEQ ID NO: 15 

375 <211> LENGTH: 35 
3 76 <212> TYPE: PRT 

377 <213> ORGANISM: Artificial Sequence 

379 <220> FEATURE: 

3 80 <223> OTHER INFORMATION : (Peptide sequence 

382 <220> FEATURE: - 

383 <221> NAME/KEY: MI SC_FEATURE 

384 <222> LOCATION: (35).. (35) 

385 <223> OTHER INFORMATION: Xaa represents a Lysine residue derivatized with a maleimide 

moie' 

386 ty. 

389 <400> SEQUENCE: 15 

3 91 Trp Glu Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Lys Leu lie His 
392 15 10 15 

3 95 Glu Leu lie Glu Glu Ser Gin Asn Gin Gin Glu Glu Asn Glu Gin Glu 
396 20 25 30 

W--> 399 Leu Leu Xaa 
400 35 
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RAW SEQUENCE LISTING ERROR SUMMARY DATE: 07/02/2004 

PATENT APPLICATION: US/10/667,966 TIME: 11:51:29 

Input Set : A:\Untitled.ST25.txt 

Output Set: N:\CRF4\07022004\J667966.raw 

please Note: 

{use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
/Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:8; Xaa Pos . 23 
Seq#:9; Xaa Pos. 4 5 
Seq#:10; Xaa Pos. 13 
Seq#:ll; Xaa Pos. 35 
Seq#:12; Xaa Pos. 13 
Seq#:13; Xaa Pos. 40 
Seq#:14; Xaa Pos. 13 
Seq#:15; Xaa Pos. 35 



file://C:\CRF4\OUTHOLD\VsrJ667966.htm 



7/2/04 



Page 7 of 8 



VERIFICATION SUMMARY DATE: 07/02/2004 

PATENT APPLICATION: US/10/667 , 966 TIME: 11:51:29 

Input Set : A:\Untitled.ST25.txt 

Output Set: N:\CRF4\07022004\J667966.raw 

L:10 M:270 C: Current Application Number differs, Replaced Current Application No 
L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
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